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Abstract: Model-based analyses have become an integral part of modern metabolic 
engineering and systems biology in order to gain knowledge about complex and not 
directly observable cellular processes. For quantitative analyses, not only experimental 
data, but also measurement errors, play a crucial role. The total measurement error of any 
analytical protocol is the result of an accumulation of single errors introduced by several 
processing steps. Here, we present a framework for the quantification of intracellular 
metabolites, including error propagation during metabolome sample processing. Focusing 
on one specific protocol, we comprehensively investigate all currently known and 
accessible factors that ultimately impact the accuracy of intracellular metabolite 
concentration data. All intermediate steps are modeled, and their uncertainty with respect 
to the final concentration data is rigorously quantified. Finally, on the basis of a 
comprehensive metabolome dataset of Corynebacterium glutamicum, an integrated error 
propagation analysis for all parts of the model is conducted, and the most critical steps for 
intracellular metabolite quantification are detected. 
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Abbreviation 



Input variable 


Symbol 


Cell dry weight 


C CDW 


Cell dry weight specific biovolume 


V CDW 


Volume-specific biovolume 


v brs 


Bioreactor sample volume 


V brs 


Cytosolic volume 


Vcyt 


Volume of the extraction reagent 


Vexc 


Total extraction volume 


Vexc 


Residual quenching volume after cell separation 


Vyes 


Volume of the quenching reagent 


Vque 


Metabolite concentration of the standard 


c std 


Metabolite concentration in the extract 


c exc 


Cytosolic metabolite concentration 


c cyt 


Metabolite concentration in the quenching supernatant 


c qsn 


Metabolite concentration in the culture broth 


c cub 


Leakage concentration 


c lea 


Peak area quotient of respective sample type 


v \...\ 


12 C peak area of respective sample type 


Vnc.l...] 


13 C peak area of respective sample type 


77l3C,r...l 



1. Introduction 

Quantitative "omics" technologies, such as metabolomics, play a key role in driving systems 
biology towards becoming an applied science for metabolic engineering and synthetic biology of 
microorganisms. Providing quantitative data of the cell's transcriptome, proteome, metabolome and 
fluxome provides the opportunity to unravel piecewise the complex regulatory mechanisms underlying 
all in vivo metabolic processes. Moreover, it is intuitively obvious that the integration of such multi- 
omics data into a mathematical model is a prerequisite for extracting hidden information on regulatory 
effects constituting the cellular metabolism [1]. This is especially true of all approaches aiming for 
mechanistic descriptions of metabolic processes, such as transcription, translation and enzyme catalysis [2]. 

However, a meaningful interpretation of multi-omics data depends to a great extent on the quality of 
the data itself. Here, it should be remembered that the term "accuracy" refers to both the "trueness" 
and "precision" of a measurement (Figure 1). While "trueness" defines the systematic deviation of an 
obtained measurement value from the required "true" value, the term "precision" is linked to the 
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reproducibility of an analytical protocol, i.e., the sum of randomly distributed errors due to handling 
steps around the actual measurement. Both terms are equivalent to "bias" and "variance" in statistics. 

Figure 1. Schematic view of the terms "trueness" and "precision" of a measurement. 




High trueness High trueness Low trueness 
High precision Low precision High precision 



Clearly, from a scientific point of view, "trueness" is most important, because measurements in 
scientific experiments should create an exact image of reality in order to be useful for generating and 
testing hypotheses. In the field of biotechnology, and especially when dealing with the analysis of 
complex biological samples of specific cultivation experiments, this is a rather difficult task. Here, 
many factors exist that are related to the technical (e.g., bioreactor device) and biological (e.g., cellular 
growth) part of an experiment, and only some of them are accessible. 

This contribution concentrates on metabolomics, which generally attempts to study metabolites and 
their concentrations, interactions and dynamics within complex samples [3]. The variability of 
chemical properties, as well as diverse concentration ranges (from picomole to millimole) within the 
metabolome, poses great challenges for intracellular metabolite quantification, and various protocols 
for sample preparation are available [4]. 

In order to obtain accurate intracellular concentration data, repetitive experiments, including all 
sample processing steps (biological replicates), are necessary. Unfortunately, this is either too 
laborious and time-consuming or simply not possible for biological reasons. Hence, these protocols are 
usually only repeated in selected parts (applying "technical replicates") to obtain at least some rough 
estimate of data accuracy. 

Thus, our major objective is to derive an analysis tool for handling error propagation during 
metabolome sample processing. This is done by formulating a process model for the complete 
workflow for the quantification of intracellular metabolites in biomass-containing samples from 
cultivation experiments. By focusing on one specific protocol, we comprehensively investigated all 
currently known and accessible factors that ultimately determine the accuracy of intracellular 
metabolite concentration data. We demonstrate that the resulting model allows a trustworthy 
estimation of intracellular concentration data and is a viable basis for the further improvement of any 
sample processing protocol. 
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2. Results and Discussion 

2.1. Sample Processing for Quantitative Intracellular Metabolomics 

Figure 2 gives an overview of our standard sample processing protocol for intracellular metabolome 
analysis. In the following, we will briefly describe all individual steps, focusing on the most relevant 
error sources. This description then provides the theoretical background for the formulation of a 
process model covering the whole procedure, ranging from biomass determination to mass 
spectrometry analytics. 

Figure 2. Scheme of different sample processing steps for intracellular metabolome 
analysis. In each step, a number of influencing variables are involved that are all subject to 
systematic and random-based errors. Hence, in order to maximize the overall accuracy of 
intracellular metabolite data, variables exerting the greatest influence have to be identified 
and the respective errors minimized. Abbreviations: QSN, quenching supernatant sample; 
CUB, culture broth sample; EXC, extract sample. 




2.1.1. Biomass Determination 



Biomass determination is usually not considered to be a step in sample preparation for metabolome 
analysis. However, it significantly affects the final results by defining the: 

• appropriate volume for the metabolome sample, 

• reference value for the resulting intracellular metabolite concentrations. 

In particular, it has to be kept in mind that the only relevant biomass for intracellular metabolite 
quantification is given by the sum of all cells with intact membranes containing the "biovolume" [5]. 
However, in many studies it is not the actual measured biovolume, but rather a rough estimate based 
on the measured cell dry weight and a literature value for the specific cell volume that serves as the 
reference value for the absolute amount of intracellular metabolites [6]. 
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2.1.2. Metabolome Sampling 

The main requirements for a sampling technique suitable for metabolome analysis are: 

• exact withdrawal of the pre-defined sample volume, 

• no time delay between sampling and inactivation of metabolism (quenching), 

• no sample contamination. 

Apart from manual sampling using syringes [7], a large number of different automatic sampling 
devices have been developed in the past few decades [8]. These are either based on active pumping 
systems or on valve controls driven by overpressure or underpressure. 

The resulting sample volume is inherently subject to a systematic deviation from the set-point 
value, depending on the specific sampling device. In the case of valve controls, this deviation is mainly 
a function of the valve opening time and the actual pressure inside the bioreactor, which can be quite 
high. In contrast, pumping systems allow the withdrawal of precise sample volumes. However, 
depending on the pipe lengths, the time delay between sampling and quenching, as well as the dead 
volume leading to sample contamination, is comparatively high. 

2.1.3. Quenching 

The ultimate challenge in the quantification of intracellular metabolites is their low chemical 
stability, as well as fast enzymatic conversion by the metabolic reactions under investigation. In recent 
years, a variety of different quenching protocols has been proposed [9], and quenching with the help of 
cold methanol solutions has become the standard procedure for bacteria. In this case, the sample is 
directly transferred into a defined volume of quenching solution consisting of 50-100% methanol with 
varying supplements, which is cooled down in the range of -20 °C to -80 °C [10-12]. 

Nowadays, it is apparent that microorganisms can lose significant amounts of intracellular 
metabolites during quenching, but it is still an open question whether the shift of temperature, the 
osmotic pressure, the influence of the organic solvent or a certain combination of all factors leads to 
the underlying change in cell membrane integrity [13]. Clearly, the leakage effect can falsify the image 
of the metabolic state of a cell dramatically. 

Hence, an appropriate quenching procedure should allow 

• fast inactivation of the cell's metabolism in a state which is as close as possible to the in vivo 
state during cultivation, 

• correction of metabolite loss as a result of leakage. 

In order to quantify and, if necessary, correct for metabolite leakage, the resulting quenching 
supernatant after the subsequent cell separation step is analyzed (cf. Figure 2, QSN sample in addition 
to the EXC sample). In addition, a cell-free sample of the culture broth is taken (cf. Figure 2, CUB 
sample) to account for the amounts of metabolites already present in the culture media 
during sampling [14]. By taking this extended version of the classical quenching approach, three error- 
prone concentration measurements for each metabolite are necessary to calculate the final 
intracellular concentration. 



Metabolites 2012, 2 



1017 



2.1.4. Cell Separation 

The separation of the quenched cells from the extracellular medium depends on the criteria of 

• fast separation to minimize the dwell time of cells in the quenching solution, 

• complete separation with minimal physical energy input. 

In general, cell separation is carried out via centrifugation or filtration. In both cases, the minimal 
separation time is a compromise to fulfill both criteria [7], and therefore, a significant effect of leakage 
per se cannot be prevented. 

Irrespective of the method applied, a certain amount of extracellular volume is present between the 
cells (intercellularly), attached to vessel walls or in the membrane pores (dead volume). Therefore, the 
sample always becomes contaminated, either by increasing or diluting the resulting amount of 
intracellular metabolites obtained after extraction. 

2.1.5. Cell Disruption and Extraction 

In the next step, the intracellular metabolites have to be extracted from the cytosol. For this task, an 
integrated method combining cell disruption and immediate metabolite extraction is usually applied 
that should primarily ensure 

• complete cell disruption, 

• extraction of the complete amount of all metabolites of interest, 

• no degradation or chemical modification of metabolites, 

• compatibility of all solvents with subsequent analytical techniques. 

On the one hand, depending on the organism and its cell wall characteristics, different methods for 
cell disruption have been developed covering physical (temperature, ultrasonic, milling, etc.), chemical 
(chloroform, etc.) or biological (enzymatic digestion) treatments [15]. However, in most cases, and 
depending on the degree of cell disruption, the mixture of the whole cytosol (biovolume) with the 
extraction reagent is incomplete. 

On the other hand, a great number of extraction reagents have proven suitable for 
prokaryotes [11,12,16]. However, all methods suffer from their more or less undefined specificity and, 
hence, extraction efficiency for certain metabolites [15]. 

2.1.6. Analysis 

In the last step of the sample processing protocol, the intracellular metabolites are quantified by 
applying a measurement method that provides 

• high sensitivity and selectivity, 

• wide linear dynamic range and broad analytical spectrum, 

• quantification via available standards. 

Due to its comparatively good performance with respect to all the above-mentioned criteria, the LC- 
ESI-MS/MS technique has been favored for quantitative metabolomics in recent years [17]. 
Nevertheless, it is also apparent that, depending on the complexity (coeluents) and chemical properties 
(pH, polarity and salt concentration) of the biological matrix, ion suppression can be a serious 
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problem, especially for ESI-MS based methods [18]. Consequently, the resulting peak intensity or area 
is not directly proportional to the amount of a certain metabolite to be quantified. 

2.2. Modeling the Metabolome Sample Processing 

The experimental protocol described in the previous sections finally results in peak area 
measurements of respective metabolites in the different process samples. In the following, a reference 
model for the metabolome sample processing is introduced, and the steps required converting the 
measured peak areas into intracellular metabolite concentrations are described, proceeding in reverse 
order through the experimental workflow. This reference model will later be refined by correcting for 
systematic errors that are known and accessible during sample processing. 

2.2.1. Estimation of Extract Concentrations 

In the first step, the measured peak area for a specific metabolite is transformed into the 
corresponding concentration value related to the total extraction volume of the extract sample. 
Following the classical external calibration approach, samples with defined metabolite concentrations 
c i,std are measured, and a linear model for the resulting standard peak areas rj i std is formulated: 

Vi,sta = Ci,std -m + n (1) 

The model parameters comprising slope m and intercept n are estimated by, for example, applying 
weighted linear least-squares regression utilizing the variances of the standard measurements as 
weights [19]. 

By transforming the model in Equation (1), the extract concentration c exc can be estimated from the 
measured peak area r] exc of the metabolite in the extract sample: 

_ Vexc ~ n 

Cexc ~ m (2) 



2.2.2. Estimation of Cytosolic Concentrations 

In order to obtain intracellular concentration data related to the cytosolic volume of extracted cells, 
the estimated extract concentration from Equation (2) has to be further converted by taking into 
account the dilution during biomass extraction. 

Usually, the cytosolic volume V cyt is estimated from the measured cell dry weight c CDW in the 

bioreactor sample of a volume V brs defined a priori and the specific biovolume v CDW taken from the 
literature: 

V C yt = C CDW ' Vbrs ' V CDW (3) 

In that case, V cyt does not reflect the actual biovolume, because the value of v CDW is not constant, 

but is highly dependent on the state of cultivation during metabolome sampling. 

The total extraction volume V e ' xc is then estimated from the cytosolic volume V cyt of disrupted cells 

and the volume of the extraction reagent V exc defined a priori: 

Vexc = V cyt + V exc (4) 
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Finally, the cytosolic concentration c cyt can be estimated as: 

_ ^exc _ Vexc ~ Tl ^exc \ 

Ccyt ~ Cexc ' 77~ ~ ~ I 1 + " ~H ~ J (5) 

v cyt m \ C CDW v brs V CDW> 

At this point, we obtained an intracellular concentration value that is most likely far from the true in 
vivo value, due to some uncertainties (e.g., matrix effects, metabolite leakage, etc.) that are not covered 
by the reference model introduced so far. 

2.3. Correction for Systematic Errors 

In general, the experimental workflow as described above involves process quantities and related 
systematic errors (biases) that are: 

• independent of the actual experiment, e.g., the pre-defined sample volume V brs (cf. Equation 
(3)): in this case, the bias from a certain set-point value can be determined in a separate 
experiment (Table 1), 

• dependent on the actual experiment, e.g., the final intracellular concentration c cyt 
(cf. Equation (5)): hence, for bias correction, additional measurements or internal standards 
are needed. 

With respect to the second case, the reference model described above will be refined in the 
following by correcting for all accessible biases that are inherent in the sample processing protocol. 

2.3.1. Matrix Effects 

Currently, the method of choice to correct for analytical biases as a result of matrix effects (e.g., ion 
suppression) is isotope dilution mass spectrometry (IDMS) [20]. In short, a prepared mixture 
containing the uniformly 13 C-labeled analogs of all target metabolites is spiked into each metabolome 
sample (including standards) before LC-MS/MS measurement and is analyzed in parallel. This allows 
the estimation of peak area quotients v of the respective 12 C and 13 C analytes: 

Vl2C 

Assuming that both analytes, i.e., 12 C and 13 C, show identical behavior regarding the underlying 
matrix effects, the quotient v of the resulting peak areas automatically corrects for this kind of bias and 
Equation (1) and (2) can be reformulated as: 

v i,std = c i,std ' 171 + n 

_ Vexc ~ n (7) 
c exc ~ _ 

m 

It should be noted that IDMS only provides trustworthy results if 12 C analyte and its 1 C analogon 
are ionized in an identical manner and, hence, are subject to the same ion suppression. This assumption 
is only valid in the linear measurement range where the ion source is not saturated with any kind of 
ionizable substance [21]. 
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2.3.2. Measurement of Biovolume 

Since the actual biovolume of the biomass in the quenching sample is the relevant reference volume 
for the amount of intracellular metabolites, a direct and experiment-specific measurement is highly 
desirable. An appropriate Coulter counter method, for example, can be used to measure the biovolume 
v brs , and the cytosolic volume is more accurately estimated by: 

Vcyt = Vbrs ' v brs (8) 

2.3.3. Incomplete Cell Separation 

Due to incomplete cell separation (subsequent to centrifugation or filtration), there is always some 
quenching supernatant left in the biomass sample prior to the extraction step. First, this leads to an 
increase in the total extraction volume already introduced in Equation (4): 

Vexc ~ ^cyt "F V exc V res (9) 

Here, the term V res denotes the residual quenching volume after cell separation, causing an 
additional dilution of the resulting extract concentration. 

Second, the metabolites present in the quenching supernatant lead to a false-to-high estimation of 
the corresponding extract concentration c exc when applying Equation (7). In order to correct for this 
bias, the metabolite concentration in the quenching supernatant c qsn is estimated according to Equation 

(7). Considering the dilution by the extract volume leads to: 

^exc 1 V res 
C exc = "~ c qsn ' 77} (10) 

111 v exc 

It should be noted that the obtained extract concentration might still be significantly biased due to 
the metabolite-specific extraction efficiency, which depends on the degree of cell disruption and the 
properties of the extraction reagent. In principle, this bias could be corrected by using an appropriate 
internal standard. However, no such standard is currently available, and therefore, the impact of 
incomplete extraction is not included in our model. 

2.3.4. Metabolite Leakage 

As a final correction step, the potential metabolite loss due to leakage is considered. The proposed 
procedure relies on the additional determination of the culture broth concentration, c cub , which is 
estimated according to Equation (7). Together with the estimates already introduced for the extract and 
quenching supernatant concentrations (cf. Equation 10), and accounting for the respective sample 
dilutions, the leakage concentration can be estimated as: 

_ Vbrs "F Vque ~ Vcyt V brs 

c lea ~ c qsn ' 7} c cub ' 77 (1 1) 

v cyt v cyt 

It should be noted that by formulating the mass balance of Equation (1 1), it is assumed that leakage 
is a result of metabolite loss via the cell membrane, e.g., by enforced diffusion, and not a consequence 
of cell damage, including loss of cytosolic volume. 
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Finally, modifying Equation (5) by taking the leakage concentration into consideration yields a 
more accurate estimate for the cytosolic metabolite concentration: 
V 

v exc 

c cyt ~ c exc ' 7} ' c lea 
v cyt 

1 / 

= ^TiT — 77-, v exc ' ( Vbrs ' Vbrs + Vexc + Vres ^ (12) 

' v brs ' ^brs 

"1" Vqsn ' (Ybrs Vque ~ ^brs ' ^brs) ~ v cub ' ^brs ~ n ' (Xexc Vque ^res)^ 



2.4. Application Example 

The final model for metabolome data processing includes the basic equations for raw-data 
conversion, as well as the extensions for the correction of systematic errors (cf. Equation 12). In the 
following example, the process model is applied to a dataset from a cultivation experiment with a 
Corynebacterium glutamicum strain (see Experimental Section for more details). The dataset contained 
44 measurements for metabolites from the central metabolism in three different samples (extract, 
culture broth and quenching supernatant) for one state of cultivation (mid-exponential phase). 

2.4.1. Linearity Check 

As a preliminary step, the data was filtered according to the linear measurement ranges resulting 
from the external calibrations with the appropriate standard quantifiers (Figure 3). The linearity check 
for this dataset leads to the following results: 

• 17 metabolites can be further processed without any restriction. 

• For five metabolites, the upper linear measurement range is violated and further sample 
dilution is necessary. 

• In the case of 22 metabolites, the lower linear measurement range is violated, indicating that 
the respective datasets cannot be further processed. 

On the basis of the 17 metabolites validated for further processing, the effect of each single bias 
correction step as discussed above was investigated. 

2.4.2. Effect of Bias Correction 

Starting with the reference model without any bias correction (cf. Equation 5), the change in the 
estimated absolute value for each metabolite concentration was simulated by successively considering 
all correction steps (cf. Eqations 6-12). The simulation results are shown in Figure 4, and it can be 
seen that: 

• For 15 of the 17 metabolites, the use of the internal standard (IDMS) leads to an increase of 
the intracellular metabolite concentration. 

• As expected, the actual measurement of the specific biovolume (v brs ) leads to a smaller total 
cytosolic volume and, hence, results in an increase of the intracellular metabolite 
concentration. 
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• Interestingly, for nearly all metabolites, the consideration of the residual quenching volume 
after cell separation (V res ) leads to only small changes in the intracellular concentration 
values. This can be easily explained by the opposing effect of metabolite dilution and 
carryover, as discussed previously in connection with Equation 9 and 10. 
Finally, it can be clearly demonstrated that of all the processing steps, leakage correction leads to 
the most significant change in the intracellular concentration value. More specifically, without leakage 
correction, the intracellular metabolite concentrations are always estimated false-to-low. 



Figure 3. Linearity check for the resulting peak area quotients of metabolites in the three 
process samples of the dataset. The vertical black lines symbolize the lower and upper 
limits of the linear measurement range. The three symbols represent the peak area quotients 
for the extract (triangle), the quenching supernatant (circle) and the culture broth (square) 
sample. Green and red symbols indicate that the underlying measurement is inside or 
outside the linear range, respectively. Accordingly, all metabolites are arranged in three 
sections: (A) The dataset can be directly used for further processing; (B) Some samples 
need further dilution and re -measurement; (C) At least one peak area quotient is below the 
lower bound, and hence, the metabolite cannot be quantified with the applied protocol. 
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Figure 4. Effect of each bias correction step on the intracellular metabolite concentration, 
(a) reference model; (b) matrix effects; (c) measurement of biovolume; (d) incomplete cell 
separation causing metabolite dilution; (e) incomplete cell separation causing metabolite 
dilution and carryover; (f) metabolite leakage. 
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Level of correction 

The refined process model allows the estimation of intracellular metabolite concentrations 
reflecting the true in vivo state of the cell's metabolome in the best way currently possible when 
applying our specific protocol. 

2.4.3. Propagation of Random Errors 

This model is then applied to estimate the precision of the resulting concentration data by 
integrating the random errors of all process quantities (input variables) following an error propagation 
analysis. Due to the non-linearity of the model, a non-linear error propagation method should also be 
preferred. Therefore, we decided to use a classical bootstrap method [22]. Briefly, bootstrapping is a 
re-sampling method and can be understood as generating multiple biological replicates in silico, 
instead of performing the same experiments in the lab. By assuming that the random errors of all input 
variables are normally distributed with known mean and variance, the process model is applied several 
times for a set of input variables that is randomly added with noise. The propagation of these errors 
through the model then results in a probability distribution for the output variable (intracellular 
concentration), allowing us to derive their mean and variance. 

Corresponding variances for all input variables were estimated in well-defined experiments (see 
Experimental Section for more details) and shown in Table 1 . The resulting errors are in the range of 
0.78% (quenching volume V que ) to 19.05% (residual quenching volume after cell separation V res ). 
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Table 1. Input variables of the metabolome sample processing protocol. Systematic (bias) 
and random (variance) errors were determined independently for each variable. 
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In Table 2, the estimated intracellular metabolite concentrations from the reference model are 
compared to the concentration data obtained when applying the refined model, including bias 
correction and error propagation analysis. It can be easily seen that both datasets are significantly 
different, thus emphasizing the necessity of accurate data processing during metabolome analysis. 

Table 2. Estimated intracellular metabolite concentrations when applying two different 
approaches of metabolome data processing. 
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(Equation 5) (Equation 12) 
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Table 2. Cont. 




Intracellular metabolite concentration [uM] 


Metabolite 


Reference model 


Refined model 




(Equation 5) 


(Equation 12) 


VAL 


113.07 


1742.05 ±91.80 


ASP 


234.97 


3903.36 ±213.42 


HSE 


94.60 


1721.95 ± 167.81 


THR 


25.37 


862.88 ±41.37 


LEU 


14.50 


157.39 ±9.10 


PRO 


322.62 


9532.64 ±497.28 



2.4.4. Sensitivity Analysis 

Finally, we addressed the question of the extent to which each input variable, including its inherent 
uncertainty, contributes to the variance of each intracellular metabolite concentration. For this purpose, 
a sensitivity analysis was performed with respect to the variances in the input variables, where each 
single variance was independently reduced to about 50% of the experimentally estimated value. 

The simulation results are shown in Figure 5, with the example of three metabolites from different 
parts of the central metabolism. It can be seen that the input variables with the greatest influence are 

12 1 3 

the C and C peak area measurements, resulting from the standard and quenching supernatant 
samples, followed by the measurement of the biovolume. 

Consequently, on the basis of this protocol, the precision of intracellular metabolite data can be 
further increased by increasing the precision of the LC-MS/MS, as well as the biovolume 
measurements. Clearly, the precision of both measurements can be improved by increasing the 
sensitivity of the underlying detector devices at the back end of the measurement method, i.e., mass 
spectrometer and Coulter counter, respectively. In case of the LC-MS/MS analytics, the matrix effects, 
although their systematic effect is corrected by applying IDMS, can still have a great impact on the 
measurement precision. Here, it is conceivable that a further optimization of the liquid chromatography 
part with respect to a better separation of all analytes will further decrease these matrix interferences. 

Figure 5. Sensitivity of the variance of each input variable to the variance of the 
intracellular metabolite concentration. The decrease of every single variance value to about 
50% of the experimentally estimated variance leads to a reduction of the final variance 
represented in the bars. 
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3. Experimental Section 

3.1. Strain and Media 

In this study, the L-lysine producer strain C. glutamicum DM1800 [24] was cultivated in defined 
glucose medium CGXII [25] containing per liter of distilled water: 20 g (NH 4 ) 2 S0 4 , lg K2HPO4, 1 g 
KH2PO4, 5 g urea, 10 g D-glucose, 13.25 mg CaCl 2 *2H 2 0, 0.25 g MgS0 4 *7H 2 0, 1 mg FeS0 4 *7H 2 0, 
1 mg MnS0 4 *H 2 0, 0.02 mg NiCl 2 *6H 2 0, 0.313 mg CuS0 4 *5H 2 0 and 1 mg ZnS0 4 *7H 2 0. The 
medium was adjusted to a pH of 7.0 with sodium hydroxide. The medium additionally contained 3 mL 
of 10% (v v" 1 ) AF 204 (Sigma) and 1 mL of a 0.2 g L -1 biotin stock solution, which were added after 
sterilization. Cryocultures were stored at -80°C in CGXII medium containing 20% (v v -1 ) glycerol. 

3.2. Cultivation Conditions 

Batch cultivation was performed in a 1.5 1 bioreactor (DASGIP AG, Julich, Germany) with a 
working volume of 1 1. Cells were directly inoculated with 2 mL of cryoculture without performing 
any pre-culture. The air flow (1 vvm) and temperature (30 °C) were kept constant. The pH was 
maintained at 7.0 by adding 4 M NaOH and 4 M HC1. Aerobic process conditions (dissolved oxygen > 
30%) were ensured via stirrer speed control (200-1200 rpm). During cultivation, the optical density, 
glucose concentration, cell dry weight, cell count and cell size were measured offline. Dissolved 
oxygen (Visiferm DO 225, Hamilton Bonaduz AG, Bonaduz, Switzerland), pH (405-DPAS-SC- 
K80/225, Mettler Toledo GmbH, Giefien, Germany) and exhaust gas concentrations of carbon dioxide 
and oxygen (GA4, DASGIP AG, Jiilich, Germany) were measured online. 

3.3. Sampling and Sample Processing 

For culture broth analysis, a cell suspension volume of 2 mL was drawn into a 5 mL plastic syringe 
and then dropped to withdraw the dead volume of the sample port. Then 3 mL was drawn into a fresh 5 
mL plastic syringe. 1 mL was centrifuged (60 s, 13.000 g, Biofuge pico, Thermo Fisher Scientific, 
Waltham, USA), and the supernatant was squeezed through a sterile filter (polyvinylidene fluoride, 0.2 
um pore size, Dia-Nielsen, Diiren, Germany). From this filtrate, 250 uL was transferred to 750 uL of 
-20 °C 60% (v v" 1 ) methanol for LC-MS/MS analysis and 50 uL was used for glucose analysis. 

For the analysis of intracellular metabolites, 6 mL of culture was drawn into a 10 mL plastic syringe 
and placed in a beaker. Then, 5 mL of the sample was transferred to a reaction tube containing 15 mL 
of -60 °C 60% methanol. The sample and the quenching solution were mixed by vigorous shaking. 
Afterwards, the quenched cells and the quenching supernatant were separated via centrifugation 
(10 min, 9,500 g, -20 °C, Avanti 30, Beckman Coulter, Krefeld, Germany), the supernatant was 
decanted, filtrated (0.22 um cellulose acetate, Dia-Nielsen) and stored at -20 °C until analysis. The 
pellet was resolved in -20 °C methanol and 4 °C TE buffer (10 mM Tris, 1 mM EDTA, pH 7.0). The 
volume of the extraction media (TE buffer and methanol) was dependent on the biovolume in the 
pellet, since the extraction of metabolites from 1 uL of biovolume requires 10 uL of methanol and 
10 uL of TE buffer. After the addition of an equal volume of -20 °C chloroform, the sample was 
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incubated for 2 h at -20 °C in a Labquake shaker (Reax 2, Heidolph, Schwabach, Germany). All 
samples were stored at -20 °C until analysis. 

For the measurement of optical density, the cell dry weight, cell count and biovolume full samples 
were taken by drawing the required volume into a plastic syringe after withdrawal of the dead volume. 

3.4. Offline Analysis 

Optical density (OD 6 oo) was measured at 600 nm (PharmaSpec UV 1700, Shimadzu, Duisburg, 
Germany) against 0.9% (w v -1 ) NaCl. Glucose was measured using an enzymatic analysis system 
(EBIO compact, Eppendorf AG Hamburg, Germany). Cell count and cell size were measured using a 
Coulter counter equipped with a 45 um capillary (CASY® 1 Modell TT, Roche Diagnostics, 
Mannheim, Germany). Cell dry weight was measured gravimetrically by placing 2 mL of a full sample 
in a reaction tube of known dry weight. The cells were separated by centrifugation (2 min, 13.000 g, 
Biofuge pico, Haereus), dried for 48 h at 80 °C and chilled to room temperature in a desiccator 
(Neubert-Glas, Geschwenda, Germany). 

3.5. Metabolome Analysis 

Culture supernatants and cell extracts were measured by HPLC (X-LC 3000 Series, Jasco, Tokyo, 
Japan) coupled to a mass spectrometer (API 4000, ABSciex, Framingham, USA) equipped with a 
Turbolon spray source. For the analysis of intermediates of the central metabolism, a CI 8 column 
(Synergi Hydro, Phenomenex, Aschaffenburg, Germany) was used with eluent A (10 mM 
tributylamine aqueous solution adjusted pH to 4.95 with 15 mM acetic acid) and eluent B (methanol) 
at a temperature of 60 °C. The elution gradient was as follows: 2 min (100% A), 5 min (80% A), 8 min 
(80% A), 10 min (65%), 14 min (0% A), 15 min (0% A), 15.5 min (100% A) and 17 min (100% A). 
Amino acids were analyzed by applying an ion exchange column (Luna SCX, Phenomenex) with 
eluent A (5% acetic acid) and eluent B (15 mM ammonium acetate aqueous solution adjusted pH to 
6.0 with 100% acetic acid) at a temperature of 40°C. The elution gradient was as follows: 10 min (85% 
A), 17 min (0% A), 25 min (0% A) and 29 min (85%). In both cases, the flow rate was 0.45 mL min" 1 
and the injection volume 10 |xl. For details regarding MS operation see [26,27]. Both methods were 

1 3 

used with a C-labeled internal standard applying the IDMS method [20]. The internal standard was 
produced by performing a batch cultivation of E. coli with uniformly 13 C-labeled glucose as the sole 
carbon source. 

3.6. Estimation of Systematic and Random Errors 

The errors of all volumetric process quantities (sample, quenching and extraction volume) were 
determined gravimetrically by aliquoting the pre-defined volumes 20 times in reaction tubes of known 
weight. The aliquotation was done using the same instruments (pipet and syringe) used for the 
metabolome analysis. From the weight differences, mean values and variances were estimated to 
correct for biases and permit error propagation analysis. 

To estimate the precision of the LC-MS/MS measurement as a function of the matrix complexity, 
all relevant sample types (standard, extract, quenching supernatant and culture supernatant sample) were 
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measured in five replicates. Here, glucose-6-phosphate, fructose-6-phosphate, phosphoenolpyruvate, 
pyruvate, acetyl-CoA, citric acid, cis-aconitate, 2-oxoglutaric acid, succinic acid and N-acetyl serine 
were chosen as examples of metabolites. The variances of the integrated peak areas were determined 

12 13 

for each C mass isotope of the analyte, as well as the corresponding C mass isotope of the internal 
standard. The extract, quenching and culture supernatant samples were taken from a cultivation of 
C. glutamicum DM1800. 

The variances of cell count, biovolume and cell dry weight measurements were determined by 
20-fold replicate measurement of cell samples taken from a cultivation of C. glutamicum DM1800. 

4. Conclusions 

For all measurements, in particular quantitative ones, a quantitative indicator of the quality of the 
result is required to assess its reliability. This is not only vital for subsequent interpretation and 
modeling purposes, but also to allow a reliable comparison of the results. In our work, we detailed the 
metabolome data processing workflow according to a specific protocol. 

A model is formulated that enables the quantitative analysis of metabolome data processing. The 
accuracy of the concentration measurements is studied with respect to uncertainties in the input 
variables. The model allows us to identify the most critical process steps and is therefore suitable for 
designing a more robust metabolome data processing pipeline. 

Moreover, our model-based approach is general in the sense that it can be readily transferred to 
alternative processing workflows to evaluate and express the inherent uncertainties of the basic 
building blocks as well as the overall uncertainty. 
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